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Originally isolated as a result of its ability to interact with tlie movement protein of Tomato 
spotted wilt virus in a yeast two-liybrid system, the 4/1 protein is proving to be an excellent 
tool for studying intracellular protein trafficking and intercellular communication. Expression 
of 4/1 in vivo is tightly regulated, first appearing in the veins of the cotyledon and later in 
the vasculature of the leaf and stem in association with the xylem parenchyma and phloem 
parenchyma. Structural studies indicate that 4/1 proteins contain as many as five coiled-coil 
(CO domains; indeed, the highest level of sequence identity among 4/1 proteins involves 
their C-terminal CC domains, suggesting that protein-protein interaction is important for 
biological function. Recent data predict that the tertiary structure of this C-terminal CC 
domain is strikingly similar to that of yeast protein She2p; furthermore, like She2p, 4/1 
protein exhibits RNA-binding activity, and mutational analysis has shown that the C-terminal 
CC domain is responsible for RNA binding. The 4/1 protein contains a nuclear export signal. 
Additional microscopy studies involving leptomycin and computer prediction suggest the 
presence of a nuclear localization signal as well. 
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INTRODUCTION 

Identification of host proteins involved in virus-host interactions 
has long been an area of intense interest for molecular virologists. 
Movement proteins (MPs) encoded by plant viruses were among 
the first viral proteins shown to have affinity for host components 
in experiments in vivo and in vitro (Citovsky et al., 1993; Wittmann 
etal, 1997; Dorokhov etal., 1999; Lazarowitz and Beachy, 2000; 
Harries etal, 2010). 

To identify^ host factors that may play important roles in virus- 
specific processes, Araijidopsis tlraliana and Nicotiana hentiiamiana 
cDNA libraries were screened in yeast two-hybrid system with 
NSm, the tubule-forming MP of Tomato spotted wilt tospovirus 
(TSWV), as bait (Soellick etal, 2000; von Bargen etal, 2001; 
Schoelz etal., 2011). Among the potentially interacting factors 
identified were a DnaJ-like co-chaperon (Soellick etal., 2000) 
and At-4/1, a previously uncharacterized protein showing some 
similarity to alpha-helical domains of myosin-, kinesin-, and 
ankyrin-like proteins (von Bargen et al., 200 1 ) . Although any func- 
tional relevance of At-4/1 for TSWV infection remains uncovered, 
subsequent studies have shown the 4/1 protein to be an excellent 
tool for studying intracellular protein trafficking and intercellular 
communication (Paape etal., 2006; Solovyev etal, 2013a). 

EVOLUTION OF THE 4/1 GENE 

4/1 genes encoded by the A. tiialiana and N. tabacum genomes 
all contain eight exons and seven introns (Paape etal., 2006; 



Makarova et al., 20 1 1 ). A similar exon-intron structure was found 
for most 4/1 genes encoded by other dicotyledonous and mono- 
cotyledonous plants. One remarkable exception was found in the 
order Rosales, where the 4/1 genes of all representatives sequenced 
(Prunus persica, Malus domestica, and Fragaria vesca) have lost 
introns 3 and 4 and thus contain only six exons (Makarova et al., 
2011). Figure lA shows an amino acid sequence alignment of 
the most conserved C-terminal region encoded by exon 8, which 
is predicted to form a coUed-coU (CC) structure (see below), 
for some of these proteins. In addition to flowering plants, 4/1 
proteins are also found in gymnosperms (Coniferophyta and 
Ginkgophyta), ferns (Polypodiopsida, Ceratopteris richardii) and 
Lycopodiophyta (Selaginella moellendorfii; Makarova etal., 2011 
and data not shown). It should be noted that in all known cases 
4/1 represents a single-copy gene with an exception of S. moellen- 
dorfii and N. tabacum having two 4/ 1 gene copies due to polyploidy 
(Makarova etal, 2011). 

Notably, we found clear evidence for the presence of a 4/1 -like 
gene in Marchantiophyta (liverwort Marchantia polymorpha) that 
occupies a basal position among land plants (Bowman, 2013). The 
organization and encoded protein sequences of this putative ances- 
tral 4/1 gene are similar to the five 3'-terminal exon-introns of the 
tobacco 4/1 gene (Figure lA and our unpublished data). More- 
over, our analysis of recently sequenced charophycean algae cDNA 
libraries (Timme etal., 2012) suggests that the typical C-terminal 
4/1 -like CC domains first appeared in orders Zygnematales 
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1 - FISRTJCLVEYQQDSTSSQHMHTLKQKIMKLRKENEVLICRQIHERKEF Niootiana tabaoum 

2 - FISRTNMVEYQQDLTSSKHMHTLKQKIMKLRKENEILICRQILERSSN Physalis peruviana 

3 - LISRTKLVEYQQDSTSSCHMHTLKQKIMKLRKENEVLItRQILERNSS Petunia axillaris 

4 - LLSRTKLREHQRDSTSNQUVETLKQKIMKLQNENEVLKRTLANSKES Olea europaea 

5 - LISRTKIJlENQLDLTSNQHaETLKQKIMKLREENEVLKRRLNDENC|l Catharanthus roseus 

6 - IISRTKLBEHHRDSTSNRHVETLKQKIMKLRKENEVLKRQVHGSKEG Vaccinium corimbosum 

7 - LISRTKLLEHQRDSTSSRNVETLKQKIMKLRKENEVLKRQLSVl™s Panax ginseng 

8 - lisktklgenkqdlsssr|cetlkqkvmtlrkenevi,krqlldqned Lactuca serrlola 

9 - SRTKLGEVENKQDMSSNRHVETLKQKVMKLRKENEVLKRQLLDQNED Tragopogon dubius 

10- LISKTKLGENKQDLSSSeJcETLKQKVMTLRKENEVLKRQLLDQNED Centaurea maculosa 

11- LINRTKLflENTQDLSSNEHVDTLKQKVMKLRKENEVLKRQIJiDSE?EGK flmbrosia trifida 

12- LISKTKLAENKQDLSSSRHVETLKQKVMTLRKENEVLKRRLLGSKED Centaurea diffusa 

13- MMNRTKMfiEHNBDLTSVRSVETLKLKIMKLRKENEILKRKLNSSSCJEG Citrus Clementina 

14- LISRTKLAEQHRDLTSVRNIETLKLKIMKLRKENEILKRKLSSSAEG Azadirachta indica 

15- LISRTKLAEQQQDSASSRQVETLKLKIMKLRKENEILKRKLSSLDDG Theobroma cacao 

16- LISRTKLGEQQQDSASNQQVETLKLKIMILRKENEILKRKLSSMEAC Gossypium hirsutum 
11- LISRTKLVEQQRDLASNLQVESLKLKLMKLRKENEILRRKISCSAEN Carica papaya 

18- LISRTSIflEKQRDTTSNRQVETLKQKLMKLRKENEILKRKLSSS Arabidopsis thaliana 

19- VISRTTVAEKQRDTTSYRQVETLKQKLMKLRKENEILKRKLSSS Brassica rapa 

20- LVSRTNIVEKQBDTTSNRQVETLKQKI*IKLRKENEl| Cardamine impatiens 

21- VLSQTITVETQSDRNSNEQVETLKQKLMKLRKENEILKRQ|SSSENS Aethionema arabicum 

22- LISRTKLVGQQBDAVSSRLVETLKLKIMKLRKENEILRRQLiCSE Quercus alba 

23- LVSRSKLVEQQRDSNSIWQVETLKTKIMKLRKENEILKRKf|hSDEG Prunus persica 

24- LVSRSKLVEQKQDSNSTWQVEMLKTKIMKLRKENEILKRKIL|sDKG Malus domestica 

25- LVSRSKLVQQQGDSNSNWQVETLKGKLMKLRKENEILKRKLHSVEG Fragaria vesca 
2 6- WICRSKIVKQQ|nltSDAHNETLKQKIVELRKEKKTLKRKLVKIERE Cannabis sativa 
21- LICKGKLVDQQQEMTANWHVETLKQKIMKLRKENEVLKRKLSHSEEGM Phaseolus vulgaris 
28- LVIAKRWDLQQEMTSNEQVETLKQKIMKLRKENEVLKRKLSHSQEVK Arachis duranensis 
2 9- LLFKIKMVDQQ2EMT|nWQVETLKQKIMKLRKENEVI,KRKFSHIDDGIQNQL Cicer arietinum 

30- LLFKAKMUDQQQEMTSNWQVETLKQKIMKLRKENEVLKRKFSHIEEGK Medicago trunoatula 

31- LICKAKLVDQQQEMTSNMHVETLKQKIMKLRKENEVLKRKFSHSQESK Glycine max 

32- LISKTKRVEQQQDLASSRLVETLKQKIMKLRKENEILKRKLSHSEEG Populus trichocarpa 

33- MISRTKLVEQKQDLSSNEFVETLKAKIMKLRKENEILKRKLSHSVEGLE Ricinus communis 

34- LVIRTKLVEQQQDMASHRLVETLKSKIMKLRKENEILKRKLSHSEEA Jatropha curcas 

35- VISKTKLVEQQQDLASNRLVETLKAKIMKLRKENEILRRKLSHSEEG Manihot esculenta 

36- LLSKTRQVEQQRDFTSNQHVETLKVKIMKLRKENEILKRKLSQSHSEEG Euonymus alatus 
31- LITRTKFAQEQRDSATNRQVETLKQKVMNLREENEILKRKL|aseEG Eucalyptus grandis 

38- LISRTKLVEQQRDLASNRHVDTLKQKIMKLRKENEVLKRKVHDAKEG Vitis vinifera 

39- LVSKAKTVD-QHFSSSDKQLDMLKQKMMKLRKENEVLICRQMLAFES Beta vulgaris 

40- LVARTKMTD-NHVSTSDRQLEMMKLKMMKLRRENEALRRQVHVSQEG Silene vulgaris 

41- IISRAKLVEKHgDSTLNHHIETLKQKVMKLRKENEVLKRKLLVLEG Nelumbo nucifera 

42- ENSCKKLADCQRDSTSNRHVETLKQKIMKLRKENEVLKRQLNHIKDRY Aquilegia formosa 

43- LVSRKKLEEQQRDSTSNEHVETLKQKIMKLRKENEALKRQlI Papaver somniferum 

44- LIIKKKQINQQRDSISLHHVETLKQKIMKLRKENESLKRQVTRI.KT Musa acuminata 

45- RITEMKQAEQQRDSVSV2HVETLKQKVMKLRKENESLKRRLASSVA Sorghum bicolor 

46- RMMBMKQAEQQRDSISVQKVETLKQKVUKLRKENESLKRRLASSEA Setaria italica 
il- RITRMKQAEQQRDSVSVQHVETLKQKVMRLRKENESLKRRLASSEV Zea mays 

48- RITRMKQAEQQHDSVSVQHVETLKQKVMKLRKENESLKRRLASCEA Saccharum officinarum 

49- RVLKMKQAEQQRDSVSVQHVETLKHRASNLLKITDSHKRRQETSKFEFS Brachypodium distachyon 

50- RIMKMKQAEQQRDSISVQHVETLKQKVMKFRKENESLKRRLAISELDCS Oryza sativa 

51- RILKMKQADQQRDTVSICHVESLKQKVMKLRKENESLKRRLASSELDCS Hordeum vulgare 

52- MMTWSKQAEHQRESISIRHVETLKQKVMKLRRENEALJKRLSGTIGSDLQHL Spirodela polyrhiza 

53- QHLRVKQTERQQESISIRHVETLKQKVMKLRKENENLKRQLASK Zostera marina 

54- LACKTEKEEQHKESASNHHVNILKQKIMKLRKENELLRRQLQE|nSAD| Aristolochia fimbriata 

55- QNAEQACKDQLAGSTSNRNWILKQKIMKLRKENEELKRKLNTVQSEEQ Amborella trichopoda 

56- LVNKMKYEEQIRDSSSTRHIDMFKQKLMKLRKENEDLKRQLL|TRQ Pinus taeda 
51- LINKMKYEEQLRDSS|NRYIDMFKQKLMKLRKENEGLRWGGKAHRG Taxus baccata 

58- SINRMKYEEHLRDLSSNRHVDMFKQKLMKLRKENEDLKRQLLASRR Ginkgo biloba 

59- laekadlekrlkessfsrhsevykqkimslrrenedlrrqlrgatqnekyvs Ceratopteris riohardii 

60- LLEKTKLEAQIQELSSNWQVGALKQKn^MKLRKENEDLKRRR Selaginella moellendorf ii 

61- MVSSVNHDDQLRELSANRQADIFKQKVLKLRKDHEVLRRQMQQNLK Marchantia polymorpha 

62- AASQLKLEEQLQESKSHRHSDILKTRI.LKLRKENEELKRAVRAAGLSIS Fenium margaritaceum 

63- LTMKMKLEERLQG|tTGRQLDVFKQKILKLRKENEELKRLMQRS Chaetosphaeridium globosum 




FIGURE 1 I Continued 
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FIGURE 1 I Continued 

(A) Multiple amino acid sequence alignment of the C-terminal domains in 
4/1 proteins. The conserved hydrophobic residues forming heptads 
(elements of potential leucine zippers) in 4/1 proteins are shown by (+) 
above the Nt-4/1 sequence. Residues replaced by alanines in Nt-4/1-KQK are 
marked with (*). Negatively charged residues are shown in blue, histidine 
residues are shaded in green, positively charged residues are shown in 
yellow, and prolines are shown in red. Taxonomic positions of the numbered 
plant species are as follows: 1-3 - subclass asterids, order Solanales; 4 - 
order Lamiales; 5 - order Gentianales; 6 - order Ericales; 7 - order Apiales; 
8-12 - order Asterales; 13-14 - subclass rosids, order Sapindales; 15-16 - 
order Malvales; 17-21 - order Brassicales; 22 - order Fagales; 23-26 - order 
Rosales; 27-31 - order Fabales; 32-35 - order Malpighiales; 36 - order 
Celastrales; 37 - order Myrtales; 38 - order Vitales; 39-40 - order 
Caryophyllales; 41 - order Proteales; 42-43 - order Ranunculales; 44 - 
class Liliopsida, order Zingiberales; 45-51 - order Poales; 52-53 - order 
Alismatales; 54 - order Piperales; 55 - order Amborellales; 56-57 - class 
Coniferopsida, order Coniferales; 58 - class Ginkgoopsida, order 
Ginkgoales; 59 - class Polypodiopsida, order Polypodiales; 60 - class 
Isoetopsida, order Selaginellales; 61 - class Marchantiopsida, order 
Marchantiales; 62 - class Zygnemophyceae, order Desmidiales; 63 - class 
Coleochaetophyceae, order Coleochaetales. Please note that some 
species belonging to eudicotyledons (numbers 39-43) are not included into 
subclasses asterids and rosids; monocotyledons (class Liliopsida) are 
numbered as 44-53; plant species numbered as 54 and 55 are basal 
flowering plants. Species 62 and 63 represent microscopic algae. (B) Tissue- 
specific activity of the Nt-4/1 promoter in tobacco flowers. Promoter activity 
was analyzed in transgenic Nicotiana tabacum cv. Samsun plants carrying 
the GUS gene under the transcriptional control of either the 35S promoter 
used as a control (left panel), or a DNA fragment comprising 2000 base 
pairs upstream of the Nt-4/1 gene transcription start site (right panel). 



(Penium margaritaceum) and Coleochaetales (Chaetosphaeridium 
globosum; Figure lA). These algal orders are believed to represent 
the closest sister group to all land plants (Turmel etal., 2013). In 
contrast, no encoded 4/1 -like protein signatures were found in 
moss Physcomitrella patens. 

TISSUE- AND ORGAN-SPECIFIC EXPRESSION OF 4/1 PROTEIN 

To analyze the tissue specificity and developmental regulation 
of Nt-4/1 gene expression, we generated transgenic tobacco and 
N. benthamiana plants carrying the GUS reporter gene under 
the control of Nt-4/1 promoter (Solovyev etal., 2013b). Our 
analyses revealed that 4/1 promoter-driven expression was first 
observed in cotyledons in association with veins. Later, at the 
two- to four-leaf stage, GUS staining was found in veins in 
the blades of first two foliage leaves as well as in cotyledon 
veins. As the plants continued to grow, GUS staining disap- 
peared from the cotyledons and was found in association with 
veins of younger leaves and stem internodes, with the hypocotyl 
being the most intensively stained. Studies of tissue- and cell- 
specific expression in these organs showed that in larger veins 
showing secondary growth, staining was associated mostly with 
the xylem parenchyma and phloem parenchyma. In young veins 
without secondary growth staining was limited to primary phloem 
cells. Stems and mature leaves showed GUS expression in vas- 
cular bundles, mostly in xylem parenchyma (Solovyev etal., 
2013b). 

Edwards et al. (2010) have used AffymetrLs GeneChip arrays to 
study tobacco gene expression from seed germination to senes- 
cence. Among the 19 different tobacco organ samples tested, 4/1 
gene expression was highest in vegetative and floral shoot apices. 



floral buds, and juvenile leaves. Somewhat lower levels of expres- 
sion were observed in stem internodes, seeds, and cotyledons 
(Edwards etal., 2010). Thus, the Affymetrix data sets for tobacco 
are in good agreement with the results of our GUS expression 
experiments. 

Detailed information on the organ-, or tissue-dependent 
expression pattern of plant genes can be retrieved from sev- 
eral public domain microarray databases, e.g., AtGenExpress at 
http://www.arabidopsis.org, PLEXdb at http://www.plexdb.org, 
eFP browser at http://bar.utoronto.ca and NCBI GEO at 
http://www.ncbi.nlm.nih.gov/geo. In poplar, 4/1 expression was 
very high in differentiating xylem (NCBI GEO GSE30507, 
GSE25309, GSE25304; Wilkins etal., 2009). In growing tips 
of maize leaves, the highest levels of 4/1 mRNA were 
present in the bundle sheath of developing veins (Li etal., 

2010) , whereas according to the Genevestigator database 
(https://www.genevestigator.com) significant expression levels 
were observed in the phloem. Second, similar to the pattern 
observed in tobacco, extremely high levels of 4/1 mRNA were 
found in the shoot apex (especially the apical meristem) of A. 
thaliana (Schmid etal., 2005), soybean (Libault etal., 2010), and 
rice (Jain etal., 2007). Third, a significant increase in 4/1 mRNA 
expression was observed at certain stages of seed formation in 
A. thaliana (most prominently in chalazal endosperm; Geneves- 
tigator and http://seedgenenetwork.net/arabidopsis), Medicago 
truncatula (He etal., 2009), soybean (Severin etal., 2010), and 
rice (Jain etal, 2007). These findings indicate that in addi- 
tion to the conservation of gene structure and sequence of the 
encoded proteins, the patterns of 4/1 gene expression in evo- 
lutionary distant species are quite similar. Notably, according 
to the Genevestigator database the highest levels of 4/1 mRNA 
were present in pollen of A. thaliana (gene At4g26020) and in 
anther/pollen of Oryza sativa (gene Os02g0 148600). In agreement 
with these data, we found 4/1 promoter activity in anthers, but 
not other parts of N. tabacum flowers, whereas the 35S promoter 
used as a control was active in pistils, anthers and petal veins 
(Figure IB). 

PHYSICO-CHEMICAL PROPERTIES OF 4/1 PROTEIN 

The Nt-4/1 protein is predicted by PSSFinder (Soh etal, 2009) 
(http://linuxl.softberry.com/berry.phtml) to form six long alpha- 
helices and two short beta-sheet structures. PCOIL (Lupas, 1996; 
Gruber etal, 2006; http://tooLkit.tuebingen.mpg.de/pcoils) and 
MARCOIL (Delorenzi and Speed, 2002; http://www.isrec.isb- 
sib.ch/webmarcoil/webmarcoUCl.html) predicted the likely pres- 
ence of five CC domains in the alpha-helical regions (positions 
44-71, 115-131, 139-167, 176-209, and 223-243; Makarova etaL, 

2011) . Similar secondary structure elements were also predicted 
for other 4/1 proteins (see, for example Figure lA). In many pro- 
teins, CC domains regulate homo- and hetero-multimerization 
important for the protein function (Gordon etal., 2009; Wang 
and Coluccio, 2010). Interestingly, the highest level of sequence 
identity among all 4/1 proteins is found between the C-terminal 
CC domains (Paape etal, 2006; Figure lA). 

To experimentally analyse the secondary structure of the Nt- 
4/1 protein, the circular dichroism spectra in the far UV region 
were measured for wild type and several mutant Nt-4/1 proteins. 
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The a-helix content in these polypeptides was estimated using 
the Greenfield-Fasman equation (Greenfield and Fasman, 1969). 
This calculation gave a value of 43% for the wild type protein 
(Makarova etal., 2011). Partial proteolysis data and differential 
scanning calorimetry (DSC) indicated that the Nt-4/1 protein 
consists of three structural domains: an N-terminal domain with a 
relatively significant level of disorder, a more stable central domain 
capable of interacting with the N-terminal part and the most stable 
C-terminal region that forms the 5th CC domain represented in 
Figure lA. 

INTERACTION WITH NUCLEIC ACIDS 

To further investigate the spatial organization of 4/1 protein, we 
used a new algorithm, QUARK (http://zhanglab.ccmb.med.umich. 
edu/QUARK), developed for ab initio protein structure prediction. 
For a query protein sequence, this algorithm first predicts a variety 
of selected structural features using a neural network. The global 
fold is then generated by replica- exchange Monte Carlo simula- 
tions which assemble the small fragments as generated by gapless 
threading through the template library (Xu and Zhang, 2012). 
Because of a length limit of 199 residues for the query sequence, 
we used QUARK to compare the organization of the shortest 4/1 
protein from M. polymorpha (196 amino acids) and an Nt-4/1 
deletion mutant lacking the N-terminal 90 amino acids with two 
predicted alpha helices (Figure 2A). Both proteins exhibited very 
similar tertiary structures, mainly an alpha up-down bundle fold. 

Next, we used the COFACTOR webserver (Roy etal, 2012) 
(http://zhanglab.ccmb.med.umich.edu/COFACTOR) for auto- 
mated structure-based protein function annotation. Starting from 
a structural model, derived from either experimental determina- 
tion or computational modeling, COFACTOR identifies template 
proteins of similar folds. Quite unexpectedly, 4/1 proteins showed 
the greatest similarity in their folding to yeast protein She2p (246 
amino acids in length). This protein is a member of a class of 
nucleic acid binding proteins that contain a single globular domain 
with a five alpha helix bundle and form a symmetric homodimer 
(Niessing et al, 2004; Figure 2A). In yeast, this protein is involved 
in directional transport and localization of a specific mRNA (ASHl 
mRNA) that regulates gene expression on a temporal and spatial 
level (Jansen and Niessing, 2012). During mitosis in Saccharomyces 
cerevisiae, ASHl mRNA is transported from the mother cell to 
the daughter cell as part of a large mRNP which contains She2p. 
ASHl mRNA contains four cw-acting hairpin regions, termed zip- 
code elements, that mediate mRNA binding to She2p as well as its 
subsequent directed intracellular transport (Jansen and Niessing, 
2012). 

Because we had previously observed homo-oligomerization of 
At-4/1 in the yeast two-hybrid assay (Minina et al., 2009), we were 
interested to determine whether or not Nt-4/1 was also capable of 
forming oligomers. Preparations of recombinant Nt-4/ 1 were ana- 
lyzed by dynamic laser light scattering, a method that measures the 
size distribution of small particles dispersed in solution, and elec- 
tron microscopy. The resulting data indicated that Nt-4/1 exists 
in solution predominantly as oligomers and multimers (Solovyev 
etal, 2013a). 

To experimentally investigate possible functional similarities 
between She2p and 4/1 proteins, the ability of Nt-4/1 protein to 



bind various forms of RNA was investigated using a combination 
of agarose gel retardation and Northwestern assays. First, Nt- 
4/1 was shown to bind single-stranded green fluorescent protein 
(GFP) transcripts much more weakly than double-stranded RNA 
obtained by annealing with a complementary RNA chain (data not 
shown). Interestingly, Potato spindle tuber viroid (PSTVd; Gross 
etal, 1978) which is an imperfect dsRNA (thus showing a simi- 
larity to zip-code elements in ASHl mRNA) binds Nt-4/1 more 
strongly than non-specific GFP dsRNA (Figure 2B). Likewise, par- 
tially double-stranded tRNA interacts with Nt-4/1 significantly 
better than non-specific GFP dsRNA (Figure 2B). Deletion of the 
C-terminal CC domain in Nt-4/1 mutant ACCV blocked viroid 
binding (to be presented eslewhere). These data suggest that the 
positively charged and highly conserved C-terminal region of 4/1 
(Figure 1 A) plays an important role in RNA binding. To verify this 
hypothesis, we constructed a Nt-4/1 mutant called Nt-4/l-KQK 
in which sis amino acid residues in the C-terminal CC region 
including three Lys and one Arg residue were replaced with Ala 
residues (Figure lA). Similar to Nt-4/l-ACCV, Nt-4/l-KQK was 
unable to interact with PSTVd (Figure 2C) . Collectively, the struc- 
tural predictions and data on protein multimerization and RNA 
binding indicate that Nt-4/1 protein and yeast She2p share several 
common features, but further research is required to determine 
whether or not these similarities reflect a general mode of action 
involving binding of imperfect RNA duplexes. It is tempting to 
speculate that 4/1 protein, similarly to She 2p, might be involved 
in addressed transport of RNA molecules to specific subcellular 
compartments in plants. Taking into account the fact that the 4/1 
expression is associated with the vascular tissues, it seems likely 
that this protein is involved in the long-distance RNA transport 
through the phloem. 

Indeed, we have recently reported evidence for a possible role 
of 4/1 protein in the long-distance vascular movement of PSTVd 
(Solovyev et al, 2013a). When PSTVd was inoculated onto N. ben- 
thamiana plants where the level of endogenous 4/1 mRNA was 
down-regulated by virus-induced gene silencing (VIGS), long- 
distance movement of the viroid into developing young leaves 
above the inoculated leaf was much more efficient than in compa- 
rable unsilenced control plants (Solovyev et al., 2013a). These data 
raise another important question for further studies, namely, does 
the interaction of 4/1 with the viroid contributes to long-distance 
pathogen movement or, alternatively, to its restriction? 

SUBCELLULAR LOCALIZATION OF 4/1 PROTEIN 

Thus far we have generated two sets of experimental data con- 
cerning the subcellular localization of 4/1 protein. First, At-4/1 
fused at its C-terminus to GFP, and the corresponding expres- 
sion vectors were used in particle bombardment assays (Paape 
et al., 2006). Confocal laser scanning microscopy (CLSM) revealed 
that a majority of At-4/1 assembled in punctate bodies at the cell 
periphery. For simplicity, the 4/1 -specific structures were called 
'FO bodies' (four/one bodies; Solovyev etal., 2013a). A striking 
feature of At-4/ 1 -GFP was the polarized distribution of FO bodies 
in epidermal cells of A. thaliana and N. benthamiana leaves. The 
protein was localized predominantly in one-half of an epidermal 
cell. The molecular basis for this peculiar subcellular distribu- 
tion is currently unknown, but among several signaling pathways 
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FIGURE 2 1 (A) QUARK predictions of 4/1 structure. Predictions for 
the M. polymorpha 4/1 protein possessing four alpha helices (left 
panel) and an Nt-4/1 segment including four C-terminal alpha helices 
(right panel) are shown. Numbers indicate positions of alpha helices 
in the entire proteins. Graphical presentation of predicted structures 
was performed with the UCSF Chimera package (Pettersen etal., 




+ Nt-4/1 



10:1 20:1 40:1 80:1 160:1 240:1 



dsRNA 
tRNA 




+ Nt-4/1 -KQK 



20:1 50:1 80:1 100:1 150:1 200:1 





PSTVd 



2004). (B) Gel retardation analysis of the competitive ability of 
Nt-4/1 to binding viroid RNA, tRNA and dsGFP RNA. Increasing 
amounts of Nt-4/1 (molar ratios of proteiniRNA ratios are indicated 
above each lane) were incubated with RNA prior to gel loading. 
(C) Comparison of viroid RNA binding by Nt-4/1 and the mutant 
Nt-4/1-KQK (see Figure 1A). 



and networks that regulate protein intracellular polarity in plants 
(Yang, 2008) the PIN auxin efflux carriers (Barbez and Kleine- 
Vehn, 2013; Moschou et al., 2013) seem the most likely candidates 
for future comparative studies with 4/1. 

In many cases, the At-4/1 FO bodies located at the cellular 
periphery appeared as twin structures consisting of two discon- 
nected bodies. Superposition of confocal and bright field images 
showed that twin FO bodies are located on opposite sides of 
the cell walls separating adjacent leaf epidermal cells. Twin-body 



formation may reflect the association of FO bodies at opposite 
neck regions of a particular PD or within the same pit field (Paape 
et al., 2006). Because the At-4/l-GFP fusion protein was expressed 
via particle bombardment of single isolated cells, observations of 
twin bodies in these experiments implies that At-4/1 is able to 
move through plasmodesmata between cells (Paape etal., 2006). 
In the second series of experiments, the subcellular localization 
of Nt-4/1 GFP fusion protein was analyzed by agroinfiltration- 
mediated transient expression (Solovyev etal, 2013a). Although 
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most of the Nt-4/1 FO bodies behaved as stationary structures, 
some of them moved within the cell. This intracellular movement 
of FO bodies depended on the actin-myosin system. Cytochalasin 
D, known to disassemble microfilaments and interfere with the 
actin-dependent protein trafficking in plant cells, blocked move- 
ment of these FO bodies. When we co-expressed Nt-4/1 -GFP and 
YFP-talin, a marker for the actin cytoskeleton, most of the FO 
bodies co-aligned with the actin microfilaments (Solovyev etal., 
2013a). Interestingly, yeast two-hybrid analysis revealed that At- 
4/1 is also capable of binding plant actin 7 (Minina et al., 2009 and 
unpublished data). 

Previously, we have hypothesized that the Nt-4/1 protein is 
capable of shuttling between the nucleus and cytoplasm (Minina 
etal., 2009). To test this hypothesis, cells expressing Nt-4/1- 
GFP were treated with leptomycin B, a drug known to suppress 
the function of CRMl/exportin 1, a carrier protein involved 
in nuclear export of proteins containing a nuclear export sig- 
nal (NFS). In this case, granular fluorescent structures were 
observed within the nuclei as well as the cytoplasm of leptomycin 
B-treated cells (Solovyev et al., 2013a). The ability of Nt-4/1 pro- 
tein to localize both to the nucleus and the cytoplasm as well 
as the effect of leptomycin B on its localization implies that 
this protein possesses both a nuclear localization signal (NLS) 
and a NES. Indeed, mutational analysis of Nt-4/1 has allowed 
us to identify a single NES (residues 196-205 of the Nt-4/1 
sequence). A search for non-canonical NLSs in Nt-4/1 protein 
using "cNLS Mapper" (http://nls-mapper.iab.keio.ac.jp) predicted 
three medium-score NLSs located at residues 40-68 (NLSl), 
150-183 (NLS2), and 202-233 (NLS3) of the Nt-4/1 sequence 
(Solovyev et al., 2013a). 

Mutation of the NES (Nt-4/1 CCII mutant) resulted in accu- 
mulation of Nt-4/1 within the nucleoplasm. The mutant protein 
accumulated as large spherical bodies as much as 4 nm in diam- 
eter. The majority of nuclei observed contained either one or 
two Nt-4/1 -CCII bodies. Single optical sections obtained by 
CLSM demonstrated that Nt-4/1 -CCII-GFP was localized only 
at the periphery of the spherical structures. Immunogold label- 
ing confirmed that Nt-4/1 -CCII-GFP was localized only to the 
peripheral electron-dense layer of the nuclear bodies. The internal 
regions contained moderately electron-dense material, suggest- 
ing that proteins other than Nt-4/1 -CCII-GFP could be located 
in the interior of the spherical structures. The question remains 
how do (and if) potential accessory proteins influence forma- 
tion of large nuclear bodies in Nt-4/ 1 NES mutant? To this end, 
experiments to isolate the large N. benthamiana nuclear 4/1 bod- 
ies and determine their molecular composition are currently in 
progress. 

FUTURE PERSPECTIVES 

In this brief review, we have attempted to provide an overall pic- 
ture of what is currently known about 4/1 protein. However, as 
additional evidence continues to accumulate, new and compelling 
questions arise, questions that must be answered in the next years 
if we are to have a more complete and realistic view of the prop- 
erties of 4/1. First, comparison of the varying patterns of 4/1 gene 
expression (or complete lack thereof) in different plant species 
may help to reconstruct the molecular networks where this gene 



is involved. To this end, we plan to create N. benthamiana All 
knock-down plants and, additionally, potato and tomato plants 
expressing heterologous 4/1 proteins. To our surprise BLAST anal- 
ysis of sequence data from solgenomics.net indicates that both 
these crops (like tobacco, members of the Solanaceae) appear to 
lack 4/1 genes. This notion can be due to the incompleteness of 
available genome sequences. However, potato and tomato are nat- 
ural hosts for PSTVd. This may correlate with both the lack of 
4/1 in these plant and enhanced PSTVd long-distance movement 
in 4/1-silenced N. benthamiana (Solovyev etal., 2013b). Second, 
in the last two decades, many viral MP determinants that are 
involved in systemic invasion of plants have been identified and 
characterized. The current state of the art directs us toward the 
following questions: how are MP-4/1 complexes formed? Once 
formed, how are these complexes distributed in the plant cell? 
Given the unanticipated diversity of MP types and assuming that 
4/1 protein interacts with the tubule-forming MPs of nepo- as well 
as tospoviruses (von Bargen, personal communication), it wiU be 
necessary to study the molecular dynamics of complex formation 
in a variety of MP contexts to unveil the spectrum of different 
mechanisms regulating MP activities after 4/1 binding. 
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